Speaking style dependency of formant targets

نویسندگان

  • Akiko Amano-Kusumoto
  • John-Paul Hosom
  • Alexander Kain
چکیده

Previous work on formant targets has assumed that these targets are independent of the speaking style. In this paper, we estimate consonant and vowel targets in a database of “clear” and “conversational” speech, using both style-independent and style-dependent models. The test-set errors and clustering of the estimated target values indicate that for this corpus, formant targets depend on the speaking style. Vowel classification accuracy was then tested on estimated target values and compared with classification based on observed formant values. Tokenbased style-independent classification shows greater accuracy for conversational speech (82.19%) than observed-value classification (73.97%).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier

The paper is aimed at determination of formant features (FF) which describe vocal tract characteristics. It comprises analysis of the first three formant positions together with their bandwidths and the formant tilts. Subsequently, the statistical evaluation and comparison of the FF was performed. This experiment was realized with the speech material in the form of sentences of male and female ...

متن کامل

A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech

A quantitative model of coarticulation is presented that accurately predicts formant dynamics in fluent speech using the prior information of resonance targets in the phone sequence, in absence of actual acoustic data. Realistic formant undershoot (reduction) and “static” sound confusion is produced naturally from the model for fast-rate speech in a contextually assimilated manner. The model de...

متن کامل

Formant Frequencies of Dutch Vowels in a Text, Read at Normal and Fast Rate*

Speaking rate is thought to affect the spectral features of vowels. Target-undershoot models of vowel production predict more spectral reduction and coarticulation of vowels in fast-rate speech than in normal-rate speech. To test this prediction, a meaningful Dutch text of about 850 words was read twice by an experienced newscaster, once at a normal speaking rate and once as fast as possible. A...

متن کامل

Rhythm and formant features for automatic alcohol detection

Two speech feature sets, RMS rhythmicity and formant frequencies F1-F4, are analyzed for their ability to distinguish alcoholized from sober speech. We describe the statistical framework based on the Alcohol Language Corpus (ALC), including other factors such as gender, age and speaking style, and its application to our case. Rhythm features are calculated using a new method based solely on the...

متن کامل

Unstressed vowels in non-native German

Vowel reduction and deletion are prominent correlates of stress in German and some preliminary investigations have suggested that this constitutes an area of difficulty for nonnative speakers. This paper explores the production of vowels in unstressed syllables by learners of German, focusing especially on the acoustic properties duration and formant structure. It is shown that the realization ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010